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Abstract. We examine necessary and sufficient conditions for posterior consis¬ 
tency under g-priors, including extensions to hierarchical and empirical Bayesian 
models. The key features of this article are that we allow the number of regres¬ 
sors to grow at the same rate as the sample size and define posterior consistency 
under the sup vector norm instead of the more conventional Euclidean norm. We 
consider in particular the empirical Bayesian model of George and Foster (2000), 
the hyper-g-prior of Liang et al. (2008), and the prior considered by Zellner and 
Siow (1980). 
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1 Introduction 

Arnold Zellner made pioneering contributions to the fields of statistics and econometrics. 
One of his works, the ( 7 -prior (Zellner, 1986), has become a cornerstone of research 
in Bayesian statistics. The g-prior specifies that a vector of regression coefficients is 
normally distributed a priori with some mean (typically zero) and covariance matrix 
equal to a scalar multiple (typically denoted by g) of the covariance matrix of the 
maximum likelihood estimator. These priors are useful for conventional hierarchical 
and empirical Bayesian analysis (Ghosh et ah, 1982) for linear regression models, but 
their application extends well beyond to variable selection (George and Foster, 2000), 
Bayesian classification of high-dimensional low-sample size data (Mallick et al., 2005), 
and many other interesting topics of research. The excellent article of Liang et al. (2008) 
provides a succinct account of mixtures of g-priors for Bayesian variable selection. 

One very important but often neglected issue in the selection of priors is to examine 
the consistency of resulting posteriors in the frequentist sense. We will provide a formal 
definition in Section 2, but in plain language, this means that as one accumulates more 
and more samples, the posterior distribution of the parameter under consideration gets 
closer and closer to its true value, eventually becoming degenerate at this point in 
the limit. Recently, the notion of posterior consistency has also been considered in 
nonparametric settings (Barron et ah, 1999; Ghosal et ah, 2000). 

In the g-prior model, if the number of regressors p does not vary with n, then 
it can easily be seen that the resulting posterior is inconsistent if g is fixed, but the 
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problem disappears when g = gn with gn ^ oo. See Section 2 for the details of these 
results. Now suppose instead that the number of regressors p = pn increases with n 
but satisfies Pn < n and p„/n —> a, where 0 < a < 1. This situation represents the 
so-called “large p, large n” regime, which has been considered in the context of model 
selection. Berger et al. (2003) provide scenarios where the Bayes factor is consistent 
but the Bayesian Information Criterion (BIC) is not, with the explanation that BIC 
may be a poor approximation to the Bayes factor when —>■ oo. Moreno et al. (2010) 
examine consistency of the Bayes factor for nested normal linear models with —>■ oo, 
including the case where Pn grows at the same rate as the sample size. Also, Jiang 
(2007) addressed the variable selection problem when > n and provided convergence 
rates for the fitted densities in a broad class of generalized linear models. 

In the context of parameter estimation as examined here, Ghosal (1999) considered 
certain types of high-dimensional linear models and provided a valuable contribution by 
proving not only posterior consistency but also asymptotic normality of the posterior 
distribution. However, our work differs from Ghosal (1999) in three principal respects. 
First, and perhaps most fundamentally, the (/-prior model itself involves an unknown 
sampling variance with an associated prior (the prior on the regression coefficients is 
taken to be conditional on a^). Such a structure is not included in the class of models 
considered in Ghosal (1999). Second, we provide necessary and sujficient conditions for 
posterior consistency in three of the four (/-prior models we consider. While we readily 
admit that stronger results such as asymptotic normality are perhaps more useful when¬ 
ever posterior consistency occurs, our necessary conditions demonstrate circumstances 
in which posterior consistency fails to occur at all, which we believe to be interesting 
in their own right. Third, our work allows the parameter space for the p„-dimensional 
vector of regression coefficients to be taken as as is natural. This contrasts with 
Ghosal (1999), which essentially requires the restriction of the parameter space to a 
sequence of compact sets. 

Bontemps (2011) also extended the work of Ghosal (1999) in several ways by permit¬ 
ting the model to be misspecified and the number of regressors to grow proportionally 
to the sample size, the latter of which is also a feature of our work. However, our work 
differs from Bontemps (2011), most notably by allowing the consideration of models 
where the sampling variance (t^ is assumed to be unknown. There are also differences in 
the assumptions. In particular, Bontemps (2011) does not make any assumption anal¬ 
ogous to the eigenvalue bounds that we will later impose in (A3). On the other hand, 
unlike Bontemps (2011), we do not make any assumptions on the asymptotic behavior 
of the true coefficient vector /3on. We must also emphasize once again that, unlike Bon¬ 
temps (2011), we provide conditions that are both necessary and sufficient for posterior 
consistency. This establishes circumstances in which posterior consistency definitively 
does not occur, which can in some cases be rather surprising (see the remarks following 
Theorem 2, for example). The recent work of Armagan et al. (2013) establishes suffi¬ 
cient conditions for posterior consistency in linear models under shrinkage priors. Again, 
the most notable difference between the models considered in Armagan et al. (2013) 
and the (/-prior based models considered in this paper is that the variance parameter 
(T^ is assumed to be known in Armagan et al. (2013). Lee and Oh (2013) consider a 
high dimensional Bayesian Principal Gomponents Analysis regression setup with > n 
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and normal priors, and examine posterior consistency (in the £ 2 -norm) and convergence 
rates under appropriate assumptions on the rank of the design matrix. 

Other authors have addressed the asymptotic properties of g-prior models, but for 
model selection instead of parameter estimation. Fernandez et al. (2001) provided both 
theoretical results and simulation-based evidence for the consistency of posterior model 
probabilities under particular choices for the g-prior hyperparameter g = gn- Liang 
et al. (2008) took a more theoretical approach and proved the consistency of posterior 
model probabilities under hierarchical and empirical Bayesian g-prior models, but only 
in the case where the dimensionality of the full model is fixed. More recently, Shang 
and Clayton (2011) provided similar results in the case where —>■ oo, albeit under a 
considerable number of assumptions. They also note that these results can be extended 
to Pn > n, the so-called “large p, small n” regime, when combined with certain dimension 
reduction approaches. See also the work of Zhang et al. (2009). 

Another new feature of our work is that we have established posterior consistency 
under the sup vector norm £oo (||®||oo = niaxi<i<p |xi|) rather than the conventional 
^2 (ll^lb = vector norm. The choice is motivated primarily because the 

£oc norm introduces added flexibility to our procedure, since it is weaker than the 
£2 norm (as a vector norm), noting that Halloo < ll®l| 2 - In particular, for proving 
consistency when the number of covariates Pn grows with the sample size, the sup norm 
approach allows to grow at a faster rate than is possible under the £2 norm. The 
simplest yet most convincing fact in this regard is the following. For the linear model 
Yn = XjiPn + Gn with i.i.d. Gaussian errors and X’^Xn = nlp„ (orthogonal covariates, 

denotes the identity matrix of dimension p„), the MLE for /3„ is consistent under 
the £2 vector norm if and only if = o(n). However, the MLE for /3„ is still consistent 
under the £cxj norm for any pn < n. See remark immediately following Lemma 1. 

As discussed above, if Pn 00 , it is harder to prove posterior consistency under the 
£2 norm as compared to the £00 norm. However, in the same vein, it is harder to prove 
posterior inconsistency under the £00 norm as compared to the £2 norm. In particular, 
any necessary condition for posterior consistency under the £00 norm is also a necessary 
condition for posterior consistency under the £2 norm. Hence, this paper also provides 
novel necessary conditions for posterior consistency under the conventional £2 norm 
(note that assumption (A2) in Section 2 subsumes the case Pn/n 0). 

The outline of the remaining sections is as follows. Section 2 provides necessary and 
sufficient conditions for posterior consistency for a nonstochastic sequence {gnyiT- > !}• 
In the process, we demonstrate the posterior consistency or inconsistency of some pop¬ 
ular recommendations regarding the choice of Section 3 provides necessary and 
sufficient conditions for posterior consistency in an empirical Bayesian context in which 
gn is estimated from the data. Section 4 provides necessary and sufficient conditions for 
posterior consistency under the hierarchical hyper-g-prior model (Liang et ah, 2008). 
Section 5 considers the celebrated Zellner-Siow prior (Zellner and Siow, 1980) and pro¬ 
vides a sufficient (though not necessary) condition for posterior consistency under this 
model. At the end of each of Sections 2-5, the interpretations and implications of the re¬ 
sults are briefly discussed. Some final remarks are made in Section 6. It should be noted 
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that although the key results of Sections 3-5 yield the same condition for posterior con¬ 
sistency, the techniques used to prove these results differ substantially among the three 
models. Furthermore, the coincidence of the conditions in Theorems 2-4 should not be 
misconstrued as a suggestion that the same condition would be shared by other hierar¬ 
chical or empirical Bayesian ( 7 -prior models. Specifically, this condition is not shared by 
Theorem 1, yet the non-hierarchical model addressed by Theorem 1 can be considered 
as a hierarchical model with a sequence of degenerate hyperpriors. Moreover, it should 
again be noted that the conditions in Theorems 1-3 are both necessary and sufficient, 
but the condition for the Zellner-Siow g-prior model provided in Theorem 4 is merely 
sufficient, and its necessity or lack thereof is not presently clear. 

2 Non-Hierarchical Model 

Consider the usual linear model Yn = XnPn+^n, with response Yn = ..., 

covariates ..., Xn,n)'^, regression coefficients /3„ = (/3n,i, ■ • ■, I3n,pn)'^ and 

errors e„ = (ei,..., e„)^. We now impose the following assumptions: 

(Al) The errors are distributed as e„ ~ 7V„(0 „,(t^I„). Here 0„ denotes the vector of 
length n with all zero entries. 

(A2) The number of regressors Pn is a nondecreasing sequence with pn < n and Pn/n ^ 
a, where 0 < a < 1 . 

(A3) The eigenvalues A„q,..., of the matrix n{X^Xn)~^ satisfy 0 < Amin < 
inf„,i Xn,i < sup„ j Xn,i < Amax < OO for SOme Amin and Amax- 

Note that (A3) implies that < n~^X^Xn < X~l^Ip^. This assumption is 

identical to assumption (A2) of Armagan et al. (2013). 

The goal in such a model is estimation of /3„. Minimal sufficiency leads to the 
reduction {(3n,Sn), where f3n = {X’^Xn)~^X^Yn, the maximum likelihood estimator 
of f3n, and Sn = ||^ — Aln/3„|||, the error sum of squares. Note that conditional on /3„ 
and (T^, 0n and S'„ are mutually independent with /3„ | /3„, cr^ ~ Np^{(3n, a‘^{X'^Xn)~^) 
and Sn I /3n,cr^ ~ a'^xl-p„- 

Now suppose priors are specified as f3n \ cr^ ^ Np^{'ym gcr^{X'^ Xn)~^) (Zellner’s 
g-prior) and ~ InverseGamma(a/2,6/2), where we permit a > —2 and 6 > 0 to 
accommodate such improper priors as 7 r((T^) oc 1/(T^,1/(T, or 1 . Suppose further that 
g = gn is specified as a known sequence of constants. This collection of likelihoods 
and priors comprises our non-hierarchical g-prior model, which we denote by Pm- One 
motivation for the use of such a model is the convenient form of the Bayes estimator 
under squared error loss. 


^'^■.= EM{(3n\^n.Sn) = 



1 


where /3„ denotes the MLE. 
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We now introduce the formal definition of posterior consistency. 

Definition. Let /3ort G for each n > 1, and let Oq > 0. Now let Pq denote the 
distribution of {{fdn, Sn),n > 1} under the model Yn = Xnfion + where e„ ^ 
iV„( 0 „, (Tg J„), for each n > 1 . The sequence of posterior distributions PMifdn I fdn, Sn) 
is said to be consistent under the £oo norm at {{f3on, < 7 g), n> 1} if Pm(| 1/3™ — /3on| |oo > 
e I /3„, Sn) 0 a.sfPo) for every e> 0. 

It should be immediately noted that the type of posterior consistency considered 
herein is fundamentally different from what could instead be considered in the analysis 
of Bayesian methodology, that is, convergence of the posterior under the same model Pm 
under which it is derived. In this case, one is assuming that the prior associated with 
the model Pm is in some sense “true.” However, this approach is perhaps too favorable 
in that posterior consistency is quite easy to achieve. In fact, in this approach, a quite 
general result due to Doob (1948) states that posterior consistency occurs on a set of 
parameter values with probability 1 under the prior associated with Pm- Instead, the 
type of posterior consistency considered herein is fundamentally frequentist in nature, 
that is, the values /3on and ctq are considered fixed but unknown. 

The frequentist properties of Bayesian methods have been of interest for some time. 
Even pure frequentists may be interested in originally Bayesian procedures, or limits and 
approximations thereof, due to considerations such as admissibility and the convenient 
elimination of nuisance parameters. Indeed, it was shown as early as Laplace (1774) 
that in simple cases, the posterior distribution and the distribution of the maximum 
likelihood estimator are comparable for large sample sizes. More sophisticated versions 
of such results have been developed in more recent times (Bernstein, 1934; Diaconis and 
Freedman, 1986; Ghosh et ah, 1982; LeCam, 1982; von Mises, 1964). 

We now provide a lemma establishing strong frequentist consistency of the MLE /3„ 
in the £oo norm. 

Lemma 1. Let Zn ~ /Vp„(0p„i^~^^)) where < n, and where the eigenvalues 
Wn,!,. .., of Vn satisfy snp^ iUJn,i = oJmux < OO. Then \\Zn\\oc ^ 0 almost surely. 

Proof (Proof of Lemma 1). First note that Var(Zn,i) = n~^Vn,ii < n~^L>j-max, and 
N{0, 1). Now let e > 0. Since 


it follows that 


n=l 


oo Pn 

n —1 2—1 
oo Pn 

- E E ^ \^mi\ > 

n—1 i—1 
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°° 1 ,3 

^ \ '' \ '' 

— 2 -^ 2 -^ £671,3 

n—1 i—1 


< 00 


by applying Markov’s inequality to ii^n i- ^^6 result follows from the Borel-Cantelli 
lemma, noting that < n. 


Observe that Lemma 1 under Pq with assumptions (Al)-(A3) and = 0n — f^on 
implies that ||/3„ — /3on||oo —t 0 a.s.(Po)- Thus, the MLE /3„ retains strong frequentist 
consistency in the ioo norm even as pn grows at a rate exactly proportional to n. To 
contrast this with the behavior of the MLE under the conventional £2 vector norm, note 
that we have the upper bound 


||/3n - /3on||i < " /3on)^ (/3n " f3on) 


and a similar lower bound with Amax replaced by Amin- Since 

M 1 


(^$n — /3on) {^n — ~ Xp„ j 


it can be immediately seen that = o{n) is required for strong frequentist consistency 
of the MLE /3„ under the £2 norm. 


In a Bayesian analysis. Lemma 1 leads to the following useful lemma, which essen¬ 
tially states that /3on may be replaced by /3„ in the definition of posterior consistency. 

Lemma 2. In the g-prior model (both hierarchical and non-hierarchical), PM{\\f3n- 
0071 I I 00 P £ I f^n, Sji’j y 0 ybr* every £ ^ 0 if and only if /^?i||oo P c | 

0n, Sn) 0 a.s.(Po) for every e > 0. 

Proof (Proof of Lemma 2). The triangle inequality implies that 
Pm /3n-$n > 2e\ ^„,Sn^ - Pm ^n-f^Or. 

(||/3« -/30n|loo > e I 

/3n ~ /3ri > 6/2 I /3n, Sn\ + Pm ( 

00 / \ 

When conditioning on fin and Sn, 

Pm (\\$n - /3on||oo > e I 0n,Sn'^ = I {\\^n “ fion 


fin - fi-. 

< Pm 

< Pm 


( 


> e 


071 0Or 


^n,Sn^ 


> £ 


> £ 


/2\^n,Sn). 


where If) denotes the usual indicator function. Lemma 1 implies that ||/3n ~/3on||oo 
0 a.s.{Po), from which it follows that /(||/3„ — /3on||oo > e) —>■ 0 a.s.{Po) for all e > 0. 
This and the above inequalities immediately yield the result. 


To establish results on posterior consistency or inconsistency in the non-hierarchical 
g-prior model, we first define Tn ■= {fin — — 7 „), so that Tnfcr'^ is the 
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usual frequentist likelihood ratio test statistic for a test of Hq : f3n = 7n vs. Ha '■ 7n 
with known variance cr^. Then the joint posterior 7r„(/3„, cr^ | (3n, Sn) is given by 


7 rn(/ 3 n, cr^ | / 3 „, Sn) OC CXp 


5 (a - A?)" ' (,3„ - 4”) 


xM 


2 \ -(n+p„+a )/2 


exp 


1 


•S'™ + 6 + 


<?n + 1 


and integrating out /3„ from this yields the marginal posterior of cr^, 


/ 2 I /3 c* \ ( 2^ (^~t”^i)/2 

TTnlO- I Pn,Sn) OC (tT j exp 


1 


S'ra + & + 




5n + 1 


i.e., cr^ I /In, S'™ ~ InverseGamma((n + a — 2)/2, Tnl2), where we dehne T„ := ^n + & + 
{gn + For notational convenience, for each n > 1, define 

5; __ ?^ll7n -/3on||l _ 

' (7n - /3on)^XjXn(7n ” M ’ 

^On E{){Tn) = Pn<^o ^-^On 117?^ /3on| I 2 5 

^On -£'o(^n) = (77. ^ ^ I l 7 n /^Onl I2I ; 

and note that Amin ^ Aon ^ Amax since Amax-^Pn — ^ “^n Si ^min^Pn. ■ 

The following lemmas establish the behavior of various quantities under Pq, and 
they will be heavily used in proving posterior consistency or inconsistency in both the 
non-hierarchical and hierarchical g-prior models. The proof of each lemma can be found 
in the Appendix. 

Lemma 3. (n — pn)~^5'„ —>■ ctq a.s.(Po)- 

Lemma 4. //a > 0 or liminfn-).oo ||7n —/Sonlli > 0; ^n/^On—>• 1 a.s.{Po). 
Lemma 5. Tn/O^n —>■ 1 a.s.(Po)- 


The following lemmas regarding the normal distribution will be useful in establishing 
the condition for posterior consistency in the non-hierarchical case. The proofs are 
provided in the Appendix. 

Lemma 6. Let Zn A^p„(/in, 5]„), S„ positive definite, n > 1. If ||/i„ — ^n|loo 0; 
then there exist e > 0 and a subsequence fc„ ofn such that P{\\Zk,, — ^fe„|loo > s) ^ 1/2 
for all n. 

Lemma 7. Let Z ^ N{p,T^). Then P{\Z\ < < 1 — 2 ^{—£^/t) for every > 0, where 

$ is the standard normal cdf. 

Lemma 8. Let Zn ^ ^pni^pn^ for eaeh n > 1, where Yin has each diagonal entry 
equal to 1 and eigenvalues a;„p,..., . //infn,i Wny = Wmin, then infn,i yar(Zi | 

Yi+l, . . . , Zp^) ^ COmin- 
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Finally, one additional lemma provides a key result about the marginal posterior 
of cr^. Again, the proof is deferred to the Appendix. 

Lemma 9^ In the non-hierarchical g-prior model, the posterior distribution of sat¬ 
isfies PM{0on/2n < cr^ < 20onln \ /3„,S'„) -)■ 1 a.s.(Po)- 

Note that although does not appear explicitly in the result in Lemma 9, thejesult 
nevertheless does depend on the choice of since it is involved in the quantity don- 

We now state and prove the necessary and sufficient condition for posterior consis¬ 
tency in the non-hierarchical g-prior model. 

Theorem 1. In the non-hierarchical g-prior model Pm, posterior consistency occurs if 
and only if both l)“^||7„-/3on||oo 0 andg„(g„-f l)“ 2 (logp„)n“i|| 7 „-/ 3 o „||2 

0 . 

The proof of this theorem in provided in the Appendix. 

2.1 Interpretations and Implications 

In the same vein as frequentist consistency, posterior consistency can be conceptualized 
as the idea that the center (not necessarily the mean) of the posterior distribution con¬ 
verges to the true value while the spread (not necessarily the variance) of the posterior 
distribution converges to zero. In light of this, it is noteworthy that the two conditions 
in Theorem 1 arise from precisely such considerations. The first condition controls the 
convergence to zero of the £oo-distance between the posterior’s center and the true value 
/3on; while the second condition controls the convergence of the posterior’s spread to 
zero. Both conditions are necessary for posterior consistency to hold. 

In the simple case where Pn does not increase with n, it is typical to fix the prior 
mean as 7 ^ = 7 and to assume that f3on = f^o also does not vary with n. In this case it 
can be immediately seen that although the second condition of Theorem 1 is satisfied, 
the first condition fails except in the serendipitous case that 7 = /3o. Of course, the 
result is somewhat obvious even without appealing to Theorem 1, since the posterior 
mean is simply a weighted average of the MLE /3„, which is strongly consistent for (3q, 
and the prior mean 7 with weights g{g + i)~^ and (g -I- 1)“^. In this case, the situation 
may be remedied by taking any choice of g„ that tends to infinity. For instance, the unit 
information prior (Kass and Wasserman, 1995) is equivalent to taking g„ = n, while 
gn = max{n,p^} has also been recommended (Fernandez et ah, 2001). Either choice 
yields posterior consistency in the fixed-p case. 

The result of Theorem 1 becomes more interesting when p„ —>■ 00 . Suppose that 
\hn - PonWoo = 0(1), but || 7 „ - PonWl = 0(p„). This can happen, for example, if (a) 
7n = Pn, or (b) the entries of /3on are uniformly bounded and 7 „ = c/3„ where 0 < c < 1 
(follows immediately from Lemma 1). In this case, the first condition is satisfied as long 
as g„ —?► 00 , but the second condition imposes the additional requirement that gn must 
grow faster thanp„n“^ logp„- The aforementioned choices oign = n or g„ = max{n,p^} 
provide posterior consistency in this case as well. 
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As another special case, suppose pn = 0{n) exactly, but suppose only a finite number 
m > 0 of components of 7 n — /3on are nonzero and these m components remain fixed as 
n grows. This circumstance could arise with the logical choice if only the first 

few covariates are present in the “true” frequentist model Pq, but covariates continue to 
be added as the sample size increases. Then any ^ oo ensures posterior consistency. 
This case is admittedly uninteresting in the non-hierarchical model, but we will revisit 
its behavior later under empirical and hierarchical Bayesian models. 


3 Empirical Bayesian Model 

A popular approach is to avoid specifying g or altogether by the use of an empirical 
Bayes method (George and Foster, 2000) in which the value of g is estimated from the 
data. The most common technique is to use the value of g that maximizes its marginal 
likelihood, restricted to g > 0. By integrating out /3„ and from the joint distribution 
of /3„, , /3„, ( 7 ^, the marginal likelihood of g is found to be 




for which the maximizing value of g subject to g > 0 is 


5 ™ := max { 0 , 


n — Pn + a — 2 
Sn + b 


Pn) 



We first provide a lemma (proven in the Appendix) that addresses the behavior of 5 ™ 
Lemma 10. //lim inf „_^oo ||7n -/3on||i > 0, then lim inf„^oo > 0 a.s.{Pa). 


Since is simply a function of {jdn, Sn), the empirical Bayes posterior is identical 
to the simple non-hierarchical Bayes posterior, but with the data-dependent quantity 
g^ in place of (/„. Thus, while Theorem 1 would allow us to immediately state a nec¬ 
essary and sufficient condition for posterior consistency in terms of g™, an alternative 
condition not involving data-dependent quantities would be preferable. The following 
result gives precisely such a condition and establishes its necessity and sufficiency. 

Theorem 2. In the empirical Bayes g-prior model, posterior consistency occurs if and 
only if either a = 0 or there does not exist a subsequence kn of n and a constant A > 0 
such that || 7 fe^ - fiok„\\l -t A and || 7 fc„ -/3ofe„||oo 0. 


The proof of this theorem is provided in the Appendix. 


3.1 Interpretations and Implications 

It should be noted that there is no immediately obvious remedy for inconsistency in 
an empirical Bayesian 5 -prior model due to the failure of the conditions in Theorem 2. 
For any particular non-hierarchical 5 -prior model. Theorem 1 implies that there always 
exists a choice of 5 „ growing sufficiently fast to ensure posterior consistency (although 
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the choice may depend on /3on)- However, such options are not available in the empirical 
Bayes approach, since g is selected via a specified function of the data. 

Another salient consequence of Theorem 2 is that if = o(n), then the empirical 
Bayes model exhibits posterior consistency for all values of 7 „ and /3o„. However, if 
Pn = 0(n) exactly, then the situation is not as simple. For example, if 7 „ = Op^ 
for every n and lim„_).oo H/SonlH = OO; then \\yk„ ~ /^ofcnlli converges to oo for every 
subsequence kn, which implies that posterior consistency occurs. Similarly, if 7 „ = /3„ 
for every n, then by Lemma 1, || 7 fc„ — /3ofc„ ||oo converges to zero for every subsequence A:„, 
which implies that posterior consistency occurs. On the other hand, suppose that only 
a fixed number p* > 0 of components of 7 „ — /3o„ are nonzero and these p* components 
remain fixed as n grows. Then clearly both ||7ra — /9on||oo and ||7n — /SonJIi converge to 
constants, so the condition of Theorem 2 fails, and the posterior is inconsistent. 

This behavior is perhaps somewhat surprising. If the prior mean 7 „ is imagined as 
a guess for the true /3on, then one might speculate that posterior inconsistency would 
only occur when the guess is quite bad, i.e., when ||7n — /^onlli or ||7n ~ /3on||oo grows 
too quickly. However, in the empirical Bayesian setting. Theorem 2 shows that this is 
not the case. Intuitively, the reason is that if we allow the data to determine the value 
of g, then a prior mean 7 „ that is “too close” to f3on (in the £2 sense) may cause the 
data to choose g values that tend to a finite constant, rather than to infinity, which 
leads to posterior inconsistency. An open question regarding this behavior is whether 
this interesting behavior is in some way dependent on the Gaussian tails imposed by 
the p-prior model. However, the derivation of a similar condition for a hierarchical g- 
prior model considered later in Theorem 3 casts doubt on this possibility, since the 
hierarchical model simply corresponds to some marginal prior with heavier tails. 


4 Hyper-gf-Prior Hierarchical Model 


An alternative approach to the specification of g is a hierarchical model in which g is 
considered a hyperparameter and is given a hyperprior 7r„(g). Under this model, the 
joint posterior is given by 


(^n; ^ T 9 \ 0n: ) 



where $^{ 9 ) ■= \ g,a'^,$n,Sn) = ff(5 + 1) + (5 + 1) Wn- Integrating out (3n 

and subsequently yields the marginal posteriors 


{a^,g I $n,Sn) oc (cr 



(g + l) 


( 1 ) 
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/ T \ 

TTnig \ $n,Sn) OC + \Sn + b+ 7r„(g). (2) 

The following technical lemma, which is proven in the Appendix, establishes a relation¬ 
ship between posterior consistency in the hierarchical g-prior model and the convergence 
of a particular sequence of posterior probabilities. Note that the lemma makes no as¬ 
sumptions on the particular form of the hyperprior TTn{g). 

Lemma 11 . In a hierarchical g-prior model, suppose that n~^ EM[g'^{g + 1 )”^ | 

(dn,Sn] —>-0 a.s.{Po). Then posterior consistency occurs if and only if PM[{g+^)~^\\'yn — 

fdon 11 oo > e I /3„, Sn] 0 a.s.{Po) for every e > 0. 


The form of the marginal posterior of g in (2) suggests that a convenient choice of 
hyperprior is 7r„(g) oc (g -|- for some constant c, called the hyper-g-prior (Liang 

et ah, 2008). This prior is proper for c > 2, and there exists an argument (Liang et ah, 
2008) for taking 2 < c < 4, but we instead permit c to take any real value in the present 
analysis. The hyper-f^-prior yields the posterior 

/ p \ —("+a—2)/2 

TTuig I 0u, Sn) oe{g + l)-(P"+'=)/2 f ^ j 

oc (5 + l)("-P"+“—2)/2 [(g + +b)+ Tn] (3) 


It will also be useful to define the transformation 

_ (g + l)(* 5 'n + b) _ Sn + b 

{g + l)(<S'n + 6) + Tn Sn + b + Tn 


(4) 


so that 5 > 0 if and only if u > Wn- The next lemma asserts that Lemma 11 applies 
with this choice of hyperprior. The proof can be found in the Appendix. 

Lemma 12. With the hyper-g-prior, n~^ E {g + 1 )~^ | — >■ 0 a.s.(Po)- 


To examine the behavior of the posterior probabilities in Lemma 11 under the hyper- 
( 7 -prior, we begin by using the posterior in (3) to write 


Pm 


1 


Ag+ 1 

= Pm 


\Yyn /3on 


1 


> e 


/3n j Sn 


g < - I l7n - /3ori 


- 1 


fdnt Sn 


/ (g + l)in-p„+a-c-2)/2 ^ ^ ^ 

Jo _ 

oOO 

/ {g + l)(n-P.+a-c-2)/2 ^ 

Jo 


— (n+a —2)/2 


dg 


— (n+o—2)/2 


dg 


where we define ( 7 „(e) := max{0, e ^|| 7 n — /3on||oo — !}• Now define 


Ln{S) 


Win f^OnWoc (Sn P b) 


£ ^ Win ~ fdonW^ (Sn P b) Tn 


Ln(e) := 


) := max |Wn,Ln(e)| 
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and apply the transformation in (4) to obtain 

1 


P, 


M 


W'^n /3on|loo P ^ 


/3n 5 


5 + 1. 

^ y(n-p„+a-c-2)/2('^ _ ^^(p„+c-4)/2 




{n-pn+a-c-2)/2 /-I _ 


(1 - du 


’Wn 


PmWu < Un<L^{e) I 0u,S„ 

PM{Un > Wn I /3n,Sn) 


(5) 


where Un ~ Beta((n — p„ + a — c)/2, (p„ + c — 2)/2) and is independent of /3„ and S'„ 
under Pm- Note that by the properties of the beta distribution, U„ 1 — a a.s.(Po), 
and PniUn > Wn \ 0m Sn) > 0 for all n a.s.(Po) since Wn < 1 for all n a.s.(Po)- We 
now introduce several technical results regarding these quantities that will be useful in 
proving the main theorem. The proofs are deferred to the Appendix. 

Lemma 13. //liminfn_>oo ||7n — (donll^ + ^ for some (5 > 0, then limsup^^o^ Wn ^ 
(1 rr)Amax^o/(^ + Amax^^o) +1 ^ tt.S.(Po)- 

Lemma 14. If ||7„ — /9on|l2 OO: then (i) Wn —>■ 0 a.s.(Po)) ond also (ii) Ln{e) —?► 0 
a.s.(P q) for every e > 0. 

Lemma 15. //liminfn^oo ||7n - /3on|loo > 0 \hn - f^OnWl A, where 0 < A < 

oo, then (i) for every e > 0, there exists L*(e) < 1 such that limsup„_jg,3 P„(e) < 
P*(e) a.s. (Pq), and (ii) for every C< 1; there exists > 0 such that 

lim inf P„ (ec) > C a.s.(Po). 

n—^oo 


To prove our main result, we will also need the following lemma, which provides a 
simple result about beta random variables, the proof of which is in the Appendix. 

Lemma 16. Let Zn Beta{an,bn) for n > 1, where On/n 1 — a and bn/n —>• a, 
with 0 < a < 1. Then P(1 — a — e<Z„<l — a + e)— for every e > 0. 

We may now state and prove the main result, a necessary and sufficient condition 
for posterior consistency in the hyper-^-prior hierarchical model. Interestingly, this con¬ 
dition is identical to the one given in Theorem 2 for the empirical Bayesian model. 

Theorem 3. In the g-prior model with the hyper-g-prior, posterior consistency occurs 
if and only if either a = 0 or there does not exist a subsequence of n and a constant 
A > 0 such that ||7fe„ - f3okJ\l A and ||7fc„ -/3ofc„||oo 0. 


The proof of this theorem is provided in the Appendix. 
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4.1 Interpretations and Implications 

It should not be entirely surprising that the empirical Bayesian and hyper-g-prior hierar¬ 
chical models share the same necessary and sufficient condition for posterior consistency. 
Indeed, the choice c = 0 yields the Uniform(0, oo) hyperprior on g, and in this case the 
marginal posterior and likelihood of g coincide. More generally, we should expect an 
adequately well-behaved hierarchical model to exhibit broadly similar behavior to the 
empirical Bayesian model, since both models essentially permit the data to determine 
the value of g. 


5 Zellner-Siow Hierarchical Model 

Another popular choice for the hyperprior T^nig) is g ~ InverseGamma(l/2, nl2), called 
the Zellner-Siow hyperprior (Zellner and Siow, 1980). The motivation behind this choice 
is clearest when X'^Xn = nip,,, in which case it leads to marginal Cauchy priors 
for each component of /3„. In this section, we will provide a sufficient condition for 
posterior consistency with the Zellner-Siow hyperprior. It still remains an open problem 
to determine if the condition is also necessary. 

For general X'^Xn, the Zellner-Siow hyperprior yields the posterior 

/ rji \ —(n+a—2)/2 

T^nig I $n, Sn) Oi {g + f 5,.,^ + fo + 7+T j 

OC (g + 1)(—Pn+a-2)/2 ^ ^ ^ -("+a-2)/2^_3/2 

X exp , (6) 

We begin with a lemma showing that Lemma 11 applies in this model. The proof is 
deferred to the Appendix. 

Lemma 17. With the Zellner-Siow hyperprior, n~^ EM[g^{g + 1)”"^ | 0n,Sn] —t 0 
a.s.{Po). 


-3/2 


exp - 


25 


Now consider the form of the posterior probabilities in Lemma 11 under this hyper¬ 
prior. By once again making the transformation in (4), we may write 


M 


5 + 1 


IITu. /^On 


> e 


fdn-) Srj 



y^{n-p„-ea-2)/2^^ _ y^(p„-4)/2 

{[u-Wn]l[Wr,{l-u)]f/^ 


nWn{l - u) 
2{u-Wn) 


du 


^(n-p„+a-2)/2f^ _ y\(p„-4)/2 

^exp 


nWn{l - u) 

2(u - Wn) 


w„ {[u-Wn]/[Wn{l-u)]y 


du 
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/ fn{u) 


-3/2 r 

exp 

1 — U 

'Wn 

. 



U-Wn 


1 — U 


-3/2 


exp 


nWn{l - u) 
2{u - Wn) 


du 


nWn{i - u) 
2{u - Wn) 


du 


(7) 


where /„ is the density of a Beta[(n — Pn + o)l2, (jpn — 2)/2] random variable with 
respect to Lebesgue measure. The following lemma (proven in the Appendix) addresses 
the lower tail probabilities of such a sequence. 

Lemma 18. Let Zn ~ Beta(a„, bn) for n> 1, where an/n —>■ 1 — a and bn/n —>■ a, with 
0 < a < 1, and let > 0. Then (i) P{Zn "£ f) for all sufficiently large n if 

a > 0, and (ii) P{Zn < 0 ^ sufficiently large n if a = 0. 


Note that the bound provided by Lemma 18 in the case where 0 < a < 1 is only 
useful if < 1/4. Now let Qn{e) and i?„ denote the numerator and denominator, re¬ 
spectively, of (7). The following lemmas establish some results regarding these quantities 
that will effectively provide the proof of the main theorem. Their proofs are provided 
in the Appendix. 

Lemma 19. //liminf„_>.oo Win — /Sonlll > then there exists a finite constant K such 
that Rn > exp(—nAT) for all sufficiently large n a.s.{Po). 

Lemma 20. //||7n —/Sonlli oo, then there exists a sequence of constants Kn{e) oo 
such that Qn{() < exp [—nK„(e)] for all sufficiently large n a.s.(Po). 

Lemma 21. If Win - /3on||i A > 0, liminf„_>oo Win - PonWoo > 0, and a = 0, then 
Qn{f)/Rn 0 a.sfPo) for every e > 0. 


We may now state the main theorem, which establishes the same sufficient condi¬ 
tion for posterior consistency under the Zellner-Siow hyperprior as for the conjugate 
hyperprior and empirical Bayes models of the previous sections. However, unlike The¬ 
orems 2 and 3, it does not establish the necessity of the condition, which remains an 
open question. 

Theorem 4. In the g-prior model with the Zellner-Siow hyperprior, posterior consis¬ 
tency occurs if either a = Q or there does not exist a subsequence kn of n and a constant 
A > 0 such that Wlk„ - PokJll A and || 7 fe„ - /3ofe„||oo 0. 

The proof of this theorem is provided in the Appendix. 


5.1 Interpretations and Implications 

Since the same condition is sufficient for posterior consistency under both the hyper-g- 
prior and Zellner-Siow hierarchical models, one might wonder if this condition is suffi¬ 
cient for posterior consistency under every hierarchical model. However, the falsehood 
of such a claim is made clear by the observation that the non-hierarchical model, for 
which the sufficient condition differs, is simply a special case of the hierarchical model 
in which the hyperprior 7r„ is specified to be degenerate at gn- In actuality, the posterior 
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consistency or inconsistency of hierarchical models with other hyperpriors on g remains 
a topic for future consideration. 


6 Summary 

We have derived conditions for posterior consistency under g-priors by defining posterior 
consistency under the £oo vector norm, which allows useful results to be obtained even 
when the number of parameters p = grows in proportion to the sample size n. Using 
this dehnition, we have obtained conditions for posterior consistency under a variety of 
g-prior models. First, we have obtained a necessary and sufficient condition for posterior 
consistency in the non-hierarchical model in which g = pn is specified as a series of con¬ 
stants. Additionally, we have derived a necessary and sufficient condition for posterior 
consistency under both the empirical Bayesian g-prior model (George and Foster, 2000) 
and the hyper-g-prior model (Liang et ah, 2008). Interestingly, we have found that the 
condition is the same for both models, and we have illustrated that the necessity of 
the condition proves posterior inconsistency in a somewhat surprising scenario. Finally, 
we have shown that this same condition is sufficient for posterior consistency in the 
Zellner-Siow (/-prior model (Zellner and Slow, 1980), but the condition’s necessity or 
lack thereof remains an open question for future consideration. 


Appendix: Proofs 


Proof (Proof of Lemma 3). Under Pq, the expectation and fourth central moment of 
Sn are Eo{Sn) = in-pn)(JQ and {p 4 )o{Sn) = l2{n-pn){n-pn+'i)crQ. Let e > 0. Then 




Sn 


n-pn 




< 



E 


n - Pn + 4 

{n-pnY 


< oo. 


so {n — Pn) ^Sn —t ctq a.sfPo) by the Borel-Cantelli lemma. 

Proof (Proof of Lemma 4). Note that under Pq, Tnjo-Q has a noncentral chi-square 
distribution with pn degrees of freedom and noncentrality parameter —/Jonlli- 

Then the fourth central moment of Tn under Pg is 


Eq 

= 12a, 


'M)o (Tn) 

(Tn-Oon)'^ =£'o| T’n- (pnO-Q+^Vr! Il7n-/30«||2) | 

0 {pnfyl + 2nA(C„^ ||7„ - l3on\\t] + 48crg (pnal + 4nAo„^ ||7„ - 


^Pn'^o ffi 2n,AQ„ II7, 

^2p„(Tg -|- 2nAg„ ||7n — /3on|l2 

= iSaXn + 192(T®0On. 


< 12(7o 


- f^OnWt) + 48crg [pnal + 4nAo„^ ||7„ - (donWt) 
'ISa^ (4pna^ + AnX^n Win - l3on\\l'^ 


( 8 ) 


Define S := liminfji_^oo Win — /Sonlli- Observe that if a > 0, then 0on > PnO'o > 
an(To/2 for all sufficiently large n, and so If 6 > 0, then 0Qn > 
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117^1 ~ f^ 0 n \\2 > ^•^max'^/2 foT all Sufficiently large n, and so Oq^ = Oin ^). Either 
way, = 0(n~^), so the fourth central moment ofTn/O^n under Pq is 


, , fTr,\ 192cr6 

Mo ^ <^ + ^ = 0(n-2)- 

V ) Oqu ^On 


Then for any e > 0, 

OO 

E^o 


n—l 


■ - 1 


POn 


\ °° 1 
>e) <E;4 (/"4)o 


n—l 


T 

n 

Oon 


< OO, 


which implies that Tn/Oon 1 a.s.{Po) by the Borel-Cantelli lemma. 

Proof (Proof of Lemma 5). It follows from (8) that the fourth central moment of Tn 
under Pq is 




■ / _ _ \ 4' 

(t„) 

II 

-1 

o 

1 

E 
_ 1 


— En 


Q ( \ 2 . '^n Pn^^o E IT'^On Win /3on|l2 

PnJ^O ' 


<?n + 1 


5n + 1 


< 


8L;o{[5„-L;o(5„)]^} + 8£:o 


Tr, 


<?n + 1 

12(7,4 


— Eq 


n 


5n + 1 


= 96(n - p„)(n - + 4)cr® + (pncrl + 2nAo„i Win - MHa) 

48^6 / ^ \ 

+ +T)4 V ”'^0 + 4nAo„ Win — /3on||2j 

< 96(n -Pn + 4)^(7® + 48(7o0o„ + 192(7^0on- 
Since Oqu > (n — Pn)o'o, the fourth central moment ofTn/Oon under Pq is 
' fr, \ 


(/44) 


0 


aon 


< 


48 


192 


96(n - + 4 )^( 7 ^ 48(7p 192(7^ 96(n - + 4)^ 

“ (n-p„)4 (n - p „)2 (n - p„)3 ’ 


0L 


which is 0{n ^). Then for any e > 0, 

f. 


E^o 


■ - 1 


On 


> 


^UE4(M4)e 


n—l 




C'On 


< OO, 


which implies that T„/0on 1 a-S-(Po) by the Borel-Cantelli lemma. 

Proof (Proof of Lemma 6). Assume ||/i.„ — ^n|loo ^ Then there exists a subsequence 
kn of n and a S > 0 such that ||/i,fc„ — ^fc„||oo > ^ for all n. There also exists an in, 
1 < in 1 Pn, such that \pkr,,in “^fcn.inl = ll/4fcn ~€fe„||oo ^ S for all n. Then either 
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pkn,in < (Case 1) or pkn,in > ^k„,i„ +(5 (Case 2). Now let 0 < e < S, and note 

that P(||Zfe„ -4fc„|loo ^ ^ P ((.kr,,i„ - S < Zk„,i„ < Cfen,i„ +'^)- Recall that is 

assumed positive definite. Then in Case 1, 

P (Cfcr..in — 5 < + <5) < P {Pk„,i„ < Zk„,i„) = 1/2, 

while in Case 2, 

P {ik„,ir, — 5 < < fk„,i„ + S) < P {Zk„^i„ < Pkr,,in) = 1/2- 

Either way, P{\\Zk„ — ^fc„|loo > e) ^ 1/2 for all n. 

Proof (Proof of Lemma 7). Note that for any t > 0, ^{z + t) — $(2 — t) is maximized 
at z = t). Henee, 

P{\z\ <0 = P(-^ <Z<()=Pl cidJf < ^ ^ 




< 4> ( $ (-- 


N J \ 'c j 

from which it immediately follows that P(|-^| < 0 ^ 1 ~ 2 $(—//t). 
Proof (Proof of Lemma 8). For each i = 1,... ,p„, partition S„ as 





^n,2,12 

Sn — 

n.i.li 

T 

^n,ii 

^n,2,2z 


y«T 

.^n,z,12 


Sn,i,22_ 


where the submatrices Tin^ipi and Sn,i,22 along the diagonal have dimension (z — 1) x 
(i — 1) and {pn — i)x (p„ — i), respectively. Then define T,n,i ■= Var{Zi \ Zi+i,..., Zp^), 
so that = Tin^ii — 22^n,i,2i- Notc that is the first diagonal entry of 


T 

n,2,22 


^n,2,2z 
^n,2,22 


n -1 


which has eigenvalues bounded above by since the eigenvalues of a principal subma¬ 
trix are bounded below by the smallest eigenvalue of the full matrix. Hence 'E~\ < 
and the result immediately follows. 

Proof (Proof of Lemma 9). Recall that T„/0on —^ 1 a.s.(Po) by Lemma 5. Then for all 
sufficiently large n. 


\ 2n n 


fdn ; Sn 


> Pm 


3T„ 


4(n + 0 — 4) 


< o-^ < 


STn 


4(n + 0 — 4) 


$n,Sn] a.s.{Po) 
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= Pm 

a^-E 

>1- 

( 4(n + a 

V t; 

- 1 

32 


Em I 0n,Sn'j 


< 


T 

n. 


4(n + a — 4) 

2 r 2 


f^n 5 


(n + a — 4)2(n + a — 6) 


n + a — 6 


where the last inequality is a consequence of Chebysheyfs inequality, for which we note 
that VarM{cr'^ \ 0n, Sn) = 2(n + a — 4)“^(n + a — 

Proof (Proof of Theorem 1). By Lemma 2, we may replace /3o„ with f3n in the definition 
of posterior consistency. We will now consider four cases. 

Case 1: Suppose (g„ + l)“^||7n - /3on||oo ^ 0. Then since || 7 „ - /3„||oo > Win - 
/3on||oo - ||/3n - /3on||oo and W$n “ /3on 11 oo 0 a.s.(Po) by Lemma 1, it follows that 
{pn + l)“^|| 7 n — ,9„||oo ^ 0 a.s.{Po). Now observe that under Pm, 


/3n 0n I O' , , Sn 



Then by Lemma 6, there exists an e > 0 and a subsequence kn of n such that, a.s.{Po), 
-PM(||/3fe„ - /3fc„||oo > e I cr^/3fc„,-5'fc„) > 1/2 for every n and every cr^ > 0. Then 


Pm (||/3fc„ —/3fe„||oo > e | /3fc„,S'fe„^ 


= Em 

> 1/2 


Pm (||/3fc„ -/3fc„||oo > e | cr'^, Sk„'^ 

for every n a.s.{Po). 


(^k„,Sk„ 


Therefore PM^Wf^n — /3n||oo > e | (3n, Sn) 0, so posterior consistency does not occur. 

For the remaining cases, suppose (pn + l)~^|| 7 n — /3on||oo —0. Then since \\in — 
(Sn 11 oo ^ I \ ln /^on 11 oo 4“ | |/3n /3on 11 oo and | |/3n /3on 11 oo t 0 a.s. (Pq ) by Lemma 1 , it 
follows that (pn + 1)“^||7„ — (SnWoo 0 a.s.{Po). Then 

Pm (||/3n - $nWoc, > 2 e | - Pm (||/3n -/3n||oo > e | $n,Sn'^ 

< Pm {\\(Sn - /3n||oo > e | $n,Sn^ 

< Pm (||/3n - /3^||oo > e /2 | /§„, + Pm {\\$n - /3n||oo > e /2 | /§„, 

by the triangle inequality. Note that 

Pm (||/3n-/3n||oo > e | /3n,S'„) =/(||/3^ -/3„||oo > e), 

where /(•) denotes the indicator function. But 0^ — /3„ = {pn + 1 )“^( 7 „ — /3„), so this 
indicator is zero for all sufficiently large n a.s.{Po). Therefore, posterior consistency 
occurs in Cases 2-3 below if and only if Pm i\\0n - $n\\oo > e I 0n,Sn) 0 a.s.(Po) 
for every e > 0. We now consider the individual cases. 
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Case 2: Suppose that {gn + 1) ^||7n — /9on||. 
l)“2(logp„)n“^||7„ - fdonWl 0. Observe that 


0, and also suppose that gn{gn 


P, 


M (||/3n - /3^||oo > e I $n,Sn^ 


= Em 
< Em 


Pm 

Pm 


+ Pm cr > 


‘20on 


> e I a^,0n,Sr 


fSn ; Sj, 


> e cr- 


fdn ; Sfi 


\$n,Sr0j 


I \ < 


26, 


On 


f^n 1 Sn 


We immediately have that Pm{<^^ > ‘^'donln \ 0n,Sn) —>■ 0 a.s.(Po) by Lemma 9, 
so it sujfices to work with the first term to establish posterior consistency. Let Vn,ij 
denote the ijth element of n{X^Xn)~^, and note specifically that the diagonal el¬ 
ements may be bounded by Amin < Vn.u < Amax for all n and i. Also recall that 
f3n- 0n \ cr^,$n,Sn ~ -A^p„(Op„, 5n(ffn + 1)“(XJX„)) Under Pm- Now let e > 0, 
and bound the aforementioned first term by 


Em 


Pi 


M 


f3n- 0n 

- 

- ^ 

_ i^l 

<Em ^ 2 $(- 

.i=l V 

<2pnEM 4>l-. 


> e cr' 




/ < 


20n 


[dn 1 Sji 


Pn,i - Pn,i 


0n, Sn 


M ^ J 



€^{grid-l)n 
y 9n'Cn,iiO'‘^ ^ 

^e^(5n + 


L\a^ < 


[djit Sji 


26, 


On 


fdn ; Sn 


= 2p„$ 




2ff nAmax^On J J V V 25 n Amax^On 

where 4>(-) denotes the standard normal cdf. Then by the Mills ratio, 


2pn^ 


i e^{gn + l)n^ 

25n Amax^On 


< 2p„ 


' Sn^mSLK^On 


exp 


e^(5n + l)n^ 

'^^rr'^max^On 


This expression clearly tends to zero if6on/n is bounded above, so we may instead assume 
that 5ori/ci —^ oo, which by inspection occurs if and only if (gn + 1)~^| |7n ~ /3ori| li c»- 
Then 0on < 2nA^„'^(gn + l)~^||7n — /^Onlli for all sufficiently large n, and hence 


2p„4> -1 


e^(gn + 

25 n Amax^On 


< 2p„ 


max Qn \hn - (dpnW 
7rAo„e2(gn + 1)2 n 


exp 


Aon£^(5n + 1)^ n 

8A 

max Qn Win - (don\ 
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< 


max Qn Win - /3on||2 logPn 
7rAo„e2(g„ + 1)2 n 


X exp 


1 - 


8A max Qn Win - /3on||2 logPri 
0 for every e > 0 


logPn 


by the assumption that gn{gn + 1) ^(logPn)?T. ^Wln — /^Onlli 0- Therefore, posterior 
consistency occurs. 

Case 3: Suppose + l)“^||7ra — /9on||oo 0, but now suppose that gn{gn + 
l)“^(logp„)n“^||7„ — /Sonlli 0- Then there exist a subsequence kn of n and a con¬ 
stant A > 0 such that gkn{gk„ + l)~^(logPfe„)A:“^||7fe„ — /Sofenlli > ^ Note 

that posterior inconsistency of the subsequence PM{f^k„ \ (3k„,Sk„) implies posterior 
inconsistency of the overall sequence PM(,(Sn \ Pn,Sn), so we may assume without loss 
of generality that kn = n for notational convenience. Also, define S„ to be the Pn x p„ 
matrix with elements '■= Vn,ij/where Vn,ij denotes the ijth element of 
n {xlx n) ^ as before. Then 


M (||/3« -/3f||oo > e I $n,Sn^ 


P. 


> Em 


> Em 


Pm ( 


f^u-Pn 


> e I a^,Pn,Sn] I \(r > 


0, 


On 




Pm max 

l<i<Pn 


- Pu,i 


> We 


2n 

a\Pn,Sn]l{a^>^-^ 


don 


2n 


Pn ; Sn 


Then we may write 


P, 


M 


{WPn-P 

> Em 


1 e I Pm Sr, 


Pm max \Zi\ > 

\ l<i<p„ 


(ffn + l)e^n 
Pn Aniine^^ 


I\a^> 


OPn 

2n 


Pn 5 Sn 


where Zn ~ Np,^(Op,^,'Sn) and is independent of cr^ under Pm. Now note that the 
innermost conditional probability is a nondecreasing function of , which implies that 


Pm {WPn - /3f ||oo > e I Pn,Sr)j 


> Em 


m 


max \Zi\> 

l<i<Pn 


l 2{gn + l)e^n^ 
dn^min^On 


aAl a^> 


= P, 


M 


max \Zi\ > 

i<i<p„ 


/2 (g„ + l)e^n2 
Pn Aniin^On 


Pm\(^> 


2n 


2n 
Pn 1 Sn I , 


Pn 7 Sn 


since the entries of depend only on Xf^Xn. Then Lemma 9 immediately implies 
that PMick"^ P I Pn,Sn) —>■ 1 a.s.{Po), so it suffices to show that the first 
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term is bounded away from zero for all sufficiently large n. Now define rjon '■= [2(gn + 
1)6 n /^n'^min^On] ^ and ^n,i ■— Vari^Z^ I Zi^i, - ■ •; ^Pn)' Then 


/ \ 


Pn 

( max \Zi \ < 77on ) = Em 

\l<i<p„ J 

Pm {\Zi \ < pon Z 2 , Z-i,. 

■ ■ ^ ^Pr.)W_h\Zi\<r|0,^} 
2=2 


/ /-\ ■ 


' Pn 

1 -2$ f -?70n/YS„p j 

Em 

'[{h\zA<vo^} 

.2=2 


by Lemma 1 and the fact that does not depend on Z 2 , Z^,..., Zp^. By repeated 
conditioning on Zi^i, Zi^ 2 , ■ • ■, Zp^ for z = 2, 3,... — 1 and application of Lemma 1 

as above, we find that 


Pm ( max \Zf < r?on ) < TT 


Pn 


1-2$ -riein/\jT‘n,i 


Note that 


which implies that 


n ^ ^ ii2^ S(g„ + l)n^ 

f^On > - ;—r Win - POnIb 7-]-^ 

9n H" 1 -^max Qn Pn 


rjOn < 


I 2Aniax^^ logPr, 


<5A„ 


The eigenvalues of Hn bounded below by Amin/Amax: so > Amin/Amax by 

Lemma 8. Then it follows that 


Pm ( max \Zf <T]on] < 
yi<i<p„ J 


1 - 2 $ 


/2A^axe^logp„ 


^2. 


< exp 


-2p„$ 


/2Aiaxe^ logPn 


^A2,„ 


Notice that if any subsequence of Pn is bounded above, then the quantity 

$[-(2ALxe"logPn/aL„)'/"] 


is bounded away from zero along that subsequence, and thus posterior inconsistency 
follows immediately. So we may instead assume that Pn —>■ oo. Then 

2AmaxS^ logPn/-)> OO, 


in which case the inequality 

1 - $(0 > (t"^ - i“^)(27r)-i/2 exp(-iV2) > {2t)-\2TT)-^/^ exp{-f/2) 
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for large t may be applied for all sufficiently large n, yielding 


M 


( max \Zi\ > rjon] > 1 - exp 

yl<2<Pri / 


-Pn 




47rA^a^e2 \ogp„ 


exp 




= 1 — exp < — < 


1 for e < 




47rA^a^e2 logp„ 


exp 


SX'L^r- 


\2 2 
^ _ ^max^' 


SX'L^r- 


logp« 


Isxi,^ 


XL 


Therefore posterior consistency does not occur. 

Proof (Proof of Lemma 10). Define S := liminf„^oo ||7„ — /3on||2i o,nd assume 6 > 0. 
Then 


Tn 


T„ 


||7n-/3o«|l2 ] > ^{<^0 + 




Pn ^On V PnXon J ^On V 2Aniax J 4A„ 

for all sufficiently large n a.s.{Po), since Tn/Oon 1 u.s.^Pq) by Lemma f. Then 


lim inf g^^ > lim inf 


n — pn + a — 2 
Sn + b 


4A„ 


- 1 


4Aniax<^r 


> 0 a.s.{Po) 


since {n — Pn + a — 2)/{Sn + b) ^ 1/aQ a.s.(Po) by Lemma 3. 

Proof (Proof of Theorem 2). By Theorem 1, we immediately have that posterior con¬ 
sistency occurs if and only if both 


llTn /^On 


ITe now consider three cases. 


0 and a.s.(Po). (9) 


{gL^ + irn' 


Case 1: Suppose there do not exist a subsequence kn of n and a constant A > 0 such 
that ||7fc„ — /Sofcnili ^ Il7fcn ~ /3ofc„||oo ^ 0. Now let kn be a subsequence of n, 
and consider two sub-cases. 


Case 1.1: Suppose ||7fc„ —/3on||oo —0. Then clearly the first condition in (9) is 
satisfied trivially. Note that for any further subsequence run of kn for which ||7m„ — 
/^Orn^lli 0) second condition in (9) is satisfied trivially as well, so we may instead 
assume liminf„_^oo ||7fe„ ~ /Sofcnili > 9. Then for all sufficiently large n u.s.^Pq), 


< 


< 


g^^logpk„ 


EB 

kn 


(3, 

log k„ 
kn 

log k„ 


1)2 fe 
Sk 


I l 7 fe„ - f^ok, 

b 


kn- Pk„+a-2 
Sk„+b 

kn - Pk„ + a - 2 



PkJhkr, - /3ofc„||i 


dok„ 


Pkn An 


0 a.s.{Po) 


( 10 ) 


by Lemmas 3, f, and 10. Thus, both conditions in (9) hold along the subsequence kn. 
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Case 1.2: Note that Case 1.1 can he applied to any further subsequence rrin of kn for 
which || 7 m„ — /3om„||oo —>■ 0, so we may suppose for Case 1.2 that liminf„_>oo || 7 fe„ ~ 
/3ofc„||oo > 0. Note also that in this case, there cannot exist any further subsequence mn 
of kn for which || 7 m„ ~/ 3 om „||2 converges to a nonzero constant, since this would con¬ 
tradict the original supposition of Case 1. Then since liminf„^oo | | 7 fc„ — /Sofcnili — 
liminf„_j.oo ||7fcr. “ f^okJllc > Oi follows that || 7 fc^ - f3okJ\2 oo. Then for all 
sufficiently large n a.s.{Po), 


1 




Il7fe„ - /3ofe„ 


< 


Sk„ + b \ / dokn 

kn- Pkr,+a- 2 J \Tk„ 

^k„ b \ f 


kn - Pfe„ -ha-2/ \Tk. 


Pk„ I hk„ - /3ofc„ 

^Ofc„ 

Pkn -^max 

kn I |7fen f^Qk„ I 


( 11 ) 


—?► 0 a.s.{Po) 


by Lemmas 3, f, and 10, while (10) also holds by the same lemmas. Thus, both conditions 
hold along the subsequence kn. Since Cases 1.1 and 1.2 together establish that both 
conditions hold along any subsequence kn, they hold for the whole sequence, and therefore 
posterior consistency occurs. 

Case 2: Now suppose there exist a subsequence kn ofn and a constant A > 0 such that 
Il7fcn “/^ofenlli —i" ^ > 0 and ||7fc„ —/3ofe„||oo ^ 0, and suppose a = 0. Note that Case 1.1 
can be applied to any further subsequence mn of kn for which ||7m„ — /9om„ ||oo —>■ 0 , so 
we may suppose for Case 2 that liminf„_>.oo || 7 fc„ — /9ofc„||oo > 0. Then (10) and (11) 
still hold by Lemmas 3, f, and 10 since Pk^/^n 0 and liminf„_^oo || 7 fc„ ~ /3ofc„||2 > 
liminf„_>oo || 7 fc„ ~ /3ofe„||oo > 0. Hence, the two conditions hold for every subsequence, 
and consequently for the overall sequence. Therefore posterior consistency occurs. 

Case 3: Now suppose there exist a subsequence kn of n and a constant ^ > 0 such 
that \ \jkn -f^okjll ^ > 0 and || 7 fc„ -/3ofc„||oo 0, but suppose a > 0. in Case 2, 

we may suppose for Case 3 that liminf„_>.oo || 7 fc„ — /3ofe„||oo > 0. Then for all sufficiently 
large n a.s.{Po), 


I |7A;n /^Ofcnlloo / ^kn "kb \ / \ f Pkn 117^^1 | | 

3f„^+l \kn-Pkn+0.-‘^j \Tkn J \ (^Okn 

Sk„ +6 \ / Opkn \ I min 

kn-Pkn+a- 2 ) \Tkn J Vfc„||7fc„ -/3ofeJl2 

X liminf || 7 fc„ - /3ofc„lloo 

n—>-oo 

-2 \ , 

° liminf || 7 fc„ -/3ofc„||oo > 0 a.s.{Pp) 

n—^oo 

by Lemmas 3, f, and 10. The first condition fails for the subsequence kn and hence for 
the overall sequence. Therefore posterior consistency does not occur. 

Proof (Proof of Lemma 11). Assume that n~^TnEM{g‘^{g+l)~‘^ \ /3n,<S'„] — >■ 0 a.s.{Pp). 
By Lemma 2, to determine whether posterior consistency occurs, it suffices to consider 
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whether PM(||/9n — /3n||oo > e | f3n,Sn) —>■ 0 a.s.{Po) for every e > 0. By iterated 
expectation and the triangle inequality, 


Em 
— Em 


^(||/3„-/3^(g) 


> 2e 


< Em 

< Em 
+ Em 


Pm 
P, 

Pm 


f3n - /3ri 


> e 

> e 

> e 


, Pn, Sn^ 
g,(J , Pn , Sr, 
g,(7 , Pm 


Pn, S, 
Pn, S, 
Pni Sn 


’m ( Pn{g)-Pn > e/2 g,a'^,Pn,Sn) Pn,Sn 

\ oo / 

( l^n-Pnig) > e/2 g,a‘^,Pn,Sn) Pn,Sn 

V OO / 


( 12 ) 


Consider PmHIPu — Pn{g)\\oo > e \ 5, cr^,/3„, S'„) for some arbitrary e > 0 and g > 0. 
Under Pm, 

f3n-P^{g) I g,a\pr„Sr,^N^„ . 

Let Vn,ii, ■ ■ ■ ,Vn,pnp„ denote the diagonal elements of n{X//Xn)~^, and write 


Pm 

Pn 


< 


( 

£pm ( 


Pn - P^{g) 

Pn,z - Pn,i{g) 


2=1 

Pn 


> € 


> e 


9 , Sn ^ 

9 ^ ? f^n 1 S', 


<^Pm 


Pn^- Pn Ad) > ^ 

2=1 

0^2 ^4^, Q\ ^2^4 


9t O' , /37T,, Sj 




ig^(T^Vn,ii ^ SAinaxff O- 


^ (5 + I)2n2e4 - (5 + 1 ) 


Then 


Em 


Pm ( 

< E 


ne 

3Am 


Pn - P^ig) 

— tjM 


> e 


I 1 O , /3n, 




ne 


-E’m 


(ff + l)= 

2 


/3n,-S', 


g 


Em (cr^ I 5:/3», 5'n^ 


Pn, Sn 


Pn, Sn 


ig + ir 

Observe from the form of the posterior in (1) that under Pm, 

n + a — 2 Sn + b + (gn + 1) ^T„ 


I g,Pn,Sn ^ InverseGamma 
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Therefore, 


Em 


Pm{ tdu-^nig) >e ff,Cr^/9n,S'r^) 

\ oo / 


3A„ 


< 


ne^ 

6 Ainj 


-Em 


['S'n + 6 + {gn + 1) ^Tn\ 

{g + l)2(n + a — 4)(n + a — 6) 


/3n, Sn 




ne^ 


+ 


6An 


rj^2 


Em 


n + a —6/ ne^ (n + a — 6)^ L (5 + 1) 


/^n ; S',, 


0 a.s.(Po) 


by Lemma 3 and the initial assumption. Then this result and the inequalities in (12) 
imply that posterior consistency occurs if and only if 


Em 


Pa, 


'm ( $nig)-0n >e g,a'^,0n,Sn) 0n,Sn 0 a.s.{Po) 

\ oo / J 


for every e > 0. Since 0n{g) — /3n = (5 + 1) ^( 7 ^ — 0n), we may equivalently state that 
posterior consistency occurs if and only if Pniig + 1) ^Wln- /3n||oo > e I /3n, 5'„) 

0 a.s.(Po) for every e > 0. But again by the triangle inequality. 


Pm 

< Pm 

< Pm 


1 


5 + 1 


Win /3on 


1 


5 + 1 
1 

5 + 1 


In 0ri 


> 2e 


> e 


/3n, Sn ) — Pm 

0n,Sr 


1 


5 + 1 


fdon fdr. 


> e 


$n, Sr 


(13) 


Win /3on|loo + 


/3n, Sr 


Pm 


.5 + 1 

For any arbitrary e > 0, 
1 


/^On (dr, 


> e/2 


/3„,5'r 


p, 


M 


5 + 1 


fdon fdn 


> e 


A 


i, Sn^ < Pm ( 


= I\ 


(don — /3; 

(don — 


> e 


$n, Sn'j 


> e) —>■ 0 a.s.(Po) 


by Lemma 1, where /(•) denotes the usual indicator function. Then this result and (13) 
together imply that posterior consistency occurs if and only if Pm [(5 + l)~^||7ra — 
/3on 1100 > e I /§„, S'™] -)■ 0 a.s.{Po) for every e > 0. 

Proof (Proof of Lemma 12). From the form of the posterior in (3) and the transfor¬ 
mation in (4-), 


T2 

" T? 


lig + ^r 


T2 

< -^Em 


(5 + 1)^ 


$n, S, 

(in,S, 
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— (n+a —2)/2 


dg 


pOO 

Tn {9 + l)("-P"+“-'=-6)/2 [(g + 1)(^„ + 6) + Tn\ 

_ Jo _ 

poo 

{g+ l)("-Pn+a-c-2)/2 ^ ^ ^ y^p(n+a-2)/2 

Jo 

(Sn + bf [ „('‘-Pn+a-c-6)/2p _ y)(p„+c)/2 

_ _ JWrv _ 

^3 f ^(n-p„+a-c-2)/2p _ ^^(p„+c-4)/2 
dWn 

Now let Hn Beta{{n — pn + a — c — 4)/2, (p„ + c — 2)/2) and Hn ~ Beta{{n — Pn + a — 
c)/2, (p„ + c — 2)/2) with both independent of fdn and Sn under Pm, und observe that 
Hn is stochastically smaller than Hn under Pm- Also let r(-) denote the usual gamma 
function. Continuing, we have that 


T2 


9 


„3 1)4 


f^n 5 Sn 


[Sn + bf r { ^-Pr^+p—^ ) r (2^^) Pm (Hn > Wn | /3„, Sn) 


^3 ^ ( n-p„+a-c ) p (Pi^) (h^ > 11.^ I 0 ^^ Sn) 


{Pn +c){pn +C - 2) 


{n — Pn + a — c — 2)(n — Pn + a — c — A) 


0 a.s.(Po) 


^ 1 f Sn + b 
~ n \ n 

by Lemma 3. 

Proof (Proof of Lemma 13). Assume liminf„^oo Win — /Sonlli ^ ^ some S > 0. 
Then 


lim sup Wn 

n-f-co 

= lim sup ( 1 + 


Tn 


V Sn+b 
2 


< 1 + lim inf 

\ n—J-oo 


PnCrl+nXg^ Win - /3on|l2 


n-pn 


lim inf 


(n - Pn) Tn 


-1 


< 1 


a 


n^oc l{Sn+b)9on\J 

\ (1 ~ Q^)'^maxO’Q ru \ 

= —T —1 - 2 — < 1 - a a-s.{Po) 

/ 0 + Amaxf^o 


1 a Aniax(l 
by Lemmas 3 and 4- 

Proof (Proof of Lemma 14). Assume Win — /3on|l2 oOi ^6^ e > 0. Then 

-1 


Wn= I 


T 

-L r). 


Sn+b 


(1 , PnCTo Il7n 

— /3on 2 

’ [n - Pn) Tn ' 

1 " ' n-pn 

. {Sn + b) 9on_ 


0 a.s.(Po) 
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since the term in square brackets converges to l/crg a.s.{Po) by Lemmas 3 and 4- This 
establishes (i). Now write 


e \hn- l3on\\^iSn+b) 

c -1 


||7ra “ ^Onlloo V Il7ra ~ /3on|loo 


= 1 + 


e Tn 


and observe that 


1 + 


e Tn 


Win f^OnWoo . 

e (pnCrl + nAgJ Win - l^0nWV\ 

1 H-- - 

(n — Pn) I \ln ~ /3on| loo 


-1 


(n - Pn) Tn 


< 1 


■||7n-/90' 


'nil 2 




(n - Pn) Tn 
{Sn + b) 0On 


_ {Sn + b) 0Qn_ 

-1 

—>■ 0 a.s.(Po) 


since, once again, the term in square brackets converges to l/cr^ a.s.{Po) by Lemmas 3 
and 4- It then follows immediately that Ln{e) —>■ 0 a.s.{Po), establishing (ii). 

Proof (Proof of Lemma 15). Assume that ||7n —/Sonlli —>■ ^ > 0 anrf liminf „_).oo ||7n — 
/3on||oo > 0. Let e > 0. Then limsup„^^ < (1 - Q;)Ai„axa’o/(^ + AmaxCo) < 
1 a.sfPo) by Lemma 13, and 


limsupL„(e) = limsup ( 1 + 


e Tn 


n— J-CX3 V ll7n-/90n|L('S'n + 6 ) 


= lim sup 1 + 


< limsup 

n—^oo \ 


(p«crg + nAo„^ ||7„ - (donWt) 
{n — Pn)\ I7ra ~ /3on| loo 
(n - Pn) Tn 


-1 


{Sn + b) 0Qn 


(n - Pn) Tn 
_ {Sn + b) 0On 
-1 


= 1 + 


eVC4 


\ 2 

^maxO’o r n \ 

< 1 a.s.{Po) 


Al/2e + Amax^o 


Amax^^O / 

since the term in square brackets converges to I/ctq a.s.{Po) by Lemmas 3 and 4- Define 

T*( \ f {^ ~ Oj)-^maxO'g AniaxCg ^ 

L (e) = max —---- 5 —, rx-;- 5 - > < 1, 

\ A+ AniaxCg -(- AmaxO'o J 

and observe f/iaf limsup„_^oo L„(e) < L*{e) a.s.{Po). This establishes (i). 

Now define A := liminf„_>.oo || 7 n — /3on||oo > 0, and note that 


lim inf Ln{e) 

n—^oo 


> lim inf I 1 + 


e Tn 


||7ra /3on|loo 
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> 


1 + lim sup 


(pnCTp 


■ Win ^On 


(n-pri) Win -/3or! 


lim sup 

n—¥oc 


{n - Pn) Tn 
_ {Sn + b) Oqu 


-1 


which implies that 


lim inf Ln{e) > 

n—¥C!0 



C (o^CTo \ 

(1 - a)Aa^ ) 


a.s.{Po) 


by Lemmas 3 and 4- Then it can be seen that for any C < 1, there exists > 0 such 
that Vnnmin^ao Ln{e() > C a-5.(Po)i establishing (ii). 

Proof (Proof of Lemma 16). Let e > 0. Note that E{Zn) = an/(a„ + bn) 1 — a, 
and thus |ara/(an + bn) — (1 — a)| < e/2 for all sufficiently large n. Also note that 
VaifZn) = a„ 6 „/[(a„ + 5„)^(an + bn + 1)] < l/a„ < 2/[n(l - a)] for all sufficiently 
large n. Then for all sufficiently large n, 


P {1 — a — e < Zn < i — a + e) 
(2 71 


= P 1-0- 


fln + bn 


— e < Zn — 


< 1 — a — 


fln + bn 


>p[-l<Zn - - 

2 - ” an + 


< ) >1-j Var{Zn) > 1 - 


n(l — a)e^ 


1 . 


where the second of the three inequalities is Chebyshev’s inequality. 

Proof (Proof of Theorem 3). By Lemmas 11 and 12, posterior consistency occurs if 
and only if PM[ig + i-)~^Wln-f^OnWoc > e | $n,Sn] 0 a.s.(Po) for every e > 0 , which 
by (5) occurs if and only if PmWWu < Un < L„(e) | 0n,Sn]/PMiUn > Wn \ /3„,S'„) 

0 a.s.{Po) for every e > 0. We now consider the same three cases as in the proof of 
Theorem 2. 


Case 1: Suppose there do not exist a subsequence kn ofn and a constant A > 0 such 
that || 7 fc„ — / 3 ofc „||2 ^ ond || 7 fc„ — /3ofc„||oo 0. Let kn be a subsequence of n, and 

let e > 0. Now consider two sub-cases. 

Case 1.1: Suppose || 7 fe„ - /3ofc„||oo 0. Then e“^|| 7 fc„ - /3ofe„||oo < 1 for all suf¬ 
ficiently large n. This implies that Lk„{e) = Wk^ for all sufficiently large n a.s.{Pf), 
and thereforePM[Wk„ < Uk„ < Lkfie) \ /3„,/S'„] = 0 for all sufficiently large n a.s.(Po). 
Also, PM{Uk„ > Wfc„ I $k„,Sk„) > 0 for all n a.s.{Po) since Wk„ < 1 for all n a.s.{Po). 
Thus, 


Pm 


Wk„ <Uk^ 


PM(i7fc„ > Wk„ I f3n,Sn) 


—>■ 0 a.s.(Po) 


by the combination of our results for its numerator and denominator. 

Case 1.2: Note that Case 1.1 can he applied to any further subsequence mn 
of kn for which || 7 m„ — /3om„||oo —>■ 0, so we may suppose for Case 1.2 that 
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lim infn_).oo || 7 fc„— /3ofc„||oo > 0. Note also that in this case, there cannot exist any further 
subsequence of kn for which || 7 m„ ~/ 3 om „||2 converges to a nonzero constant, since 
this would contradict the original supposition of Case 1. Then since lim inf„_>.oo || 7 fc„ ~ 
/3ofc„||i > liminf„^.oo || 7 fe„ -/3ofe„||L > 0, it follows that \\'yk„ -/3ofc„||i oo. Then 
Lemma 14 implies that both Wk„ —>■ 0 a.sfPo) and L„(e) —>■ 0 a.sfPo), which in 
turn implies that both Wk„ < (1 — a)/2 and Lfc^(e) < (1 — a )/2 for all sufficiently 
large n a.sfPo). Then for all sufficiently large n a.sfPo), 


Pm 

Wk„ 

< Uk„ < LkS^) 

$k„,Sk„ 


Pm 

[Uk„ > Wk„ 1 /3fc„ 



Pm 

Pi ^ ^ “ 

0k„ , Sk„ 

^Ck„ < 2 

Pm 

Pi. ^ ^ ~ 

$k„ , Sk„ 

^Ck„ > 2 


a.sfPo) 


by Lemma 16. Finally, since Cases 1.1 and 1.2 together establish that the relevant con¬ 
dition holds along any subsequence kn, it holds for the whole sequence, and therefore 
posterior consistency occurs. 

Case 2: Now suppose there exist a subsequence kn of n and a constant ^ > 0 such 
that || 7 fc„ — /Sofcnili —i" ^ > 0 and || 7 fe„ — fink^Woo 0, and suppose a = 0. Note that 
Case 1 can be applied to any subsequence mn of n for which either || 7 m„ — / 3 om „||2 
does not converge to any nonzero constant or ||7m„ — /3om„||oo —>■ 0, so it suffices 
to show that the relevant condition holds along the subsequence kn. Note also that 
this means we may suppose for Case 2 that liminf„^oo ||7fc„ ~ /3ofe„||oo > 0. Now let 
e > 0. By Lemma 13, limsup„^(^ < AniaxO'Q/(A + AinaxO’Q) a.s.(Po), which implies 
that < 2AmaxO'Q/(A + 2AinaxO’o) for all sufficiently large n a.s.{Po). Moreover, by 
Lemma 15, there exists L*{e) < 1 such that limsup„_j,(^ Lk„{e) < L*(e) a.s.{Po), which 
implies that Lk.„{e) < [1 + L*(e)]/2 for all sufficiently large n a.s.{Po). Then for all 
sufficiently large n a.s.(Po); 


P, 


M 


Wk„ <Uk„ <Lk„{e) I /3fc„,5fe„ 


Pm 


PM[Uk„>Wk„ I 0k„,Sk„) 
1 + L*{e) 


< 


Uk,, < 


Pk„,Sk„ 


Pm ( C^fc„ > 


2 Amaxf^O 
^ 2Aniax^o 


^k„,Sk„ 


0 a.s.{Po) 


by Lemma 16. Therefore posterior consistency occurs. 

Case 3: Now suppose there exist a subsequence kn of n and a constant ^ > 0 such 
that || 7 fe„ —/Sofcnili —>■ ^ > 0 and || 7 fc„ —/3ofc„||oo ^ 0, but suppose a > 0. By Lemma 13, 
limsup^^oo < (1 - a)AinaxO-o/(^ + AmaxO-o) a.s.{Po), which implies that Wk„ < 
2(1 — Q:)AinaxO'o/(^ + 2 Ainax< 7 g) for all Sufficiently large n a.s.{Po). By Lemma 15, there 
exists ei_a /4 > 0 such that liniinf„_).oo (£ 1 - 0 / 4 ) > 1 — a/4 a. 5 .(Po )7 which implies 
that Lk„{ei-al 4 :) > 1 — q;/2 for all sufficiently large n a.s.(Po). Then for all sufficiently 
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large n a.s.{Po), 


Pm 

^ (^1 —CA/ 4 ) 


Pm 

(Uk„ > 1 /3fc„,5 

kf 



> Pm 


> Pm 


^ ^ ^kni^l — a/i) | /3fe„ J ‘S'fe„ 


2(1 O;)AniaxC^0 


+ 2A„,axa2 < 1 2 


0k„,Sk„ 


1 O.S.(Po) 


by Lemma 16. Since the relevant condition fails to hold for the subsequence kn, it fails 
to hold for the overall sequence. Therefore posterior consistency does not occur. 

Proof (Proof of Lemma 17). Consider two cases. 

Case 1: Suppose || 7 „ - /3on||i -5^ 0. Then 0on = PnC^o + nXonlhn - f3on\\l < na^ for 
all sufficiently large n. This result and (8) imply that 


(/^4)q (Tn) Eq 




< + 192cTg6»o„ < 96riV^ 


for all sufficiently large n. Then there exists N such that 

oo oo oo 2 8 

^ Po (r„ > 2nal) <J2 Po (|T„ - 0on| > na^^) < ^ 

TL (T n 

n=iV n=iV n=iV ^ 


E 

=N 


< OO 


by Markov’s inequality applied to (Tn — ^On)^, which in turn implies by the Borel-Cantelli 
lemma that \imsup^^^^{Tn/n) < 2crQ a.s.(Po)- Therefore, n~^ T^ EMlg'^ig + \ 

Sn] < n-^ T^ ^ 0 a.s.{Po). 


Case 2: Note immediately that Case 1 can be applied to any subsequence kn of n for 
which ||7n —/Sonlli 0) so we may suppose for Case 2 that liminf„_>oo ||7n —/9ora||| > 0. 
Then limsup„^^ Wn <l — aby Lemma 13. Define tpniu) := /(i 4 /„,i)(u) exp[—nWn(l— 
u)/2{u— W„)], where I denotes the usual indicator function, and note that this is a 
nondecreasing function of u on the interval (0,1). Using the form of the posterior in (6) 
and the transformation in (f), we may write 


T2 

n 


(5 + 1)' 


f^n 1 


rjiZ 


-(n+a-2)/2 ^ 


[(5 + l)('S'n + b) +T„ 


25 


dg 


(^ + l)("-Pn+a-2)/2 _ ^ 

exp^--J dg 


poo 

— 

[(5 + l)(5'n + 5) + Tn] 

{Sn+b)^ [ y(’^-Pn+“-=-10)/2(l_y)(Pn+4)/2 

Jo 


U-Wn 


nl/2 


Wn{l-U) 


fniu) du 


n^Tl / y(’^-Pn+a-c-2)/2(^_y)(p„-4)/2 


'--Wn 


1 - 3/2 


Lw„(i-u) 


fn(u) du 
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{Sn + hf r ^(n-p„+a-c-9)/2(^ _ ^)(p„+3)/2 

^ _ Jo _ 

n^l-Wnf f 

Jo 


du 


(Pn-l)/2 


tpniu) du 


Now let hn and hn denote the densities with respect to Lebesgue measure of Beta{{n — 
Pn + a — 7)/2, {pn + 5)/2) and Beta{{n — pn + a — 3)/2, (p„ + l)/2) random variables, 
respectively. Then we may continue by writing 


-Em 


9 


ig + ^y 


fJn ; Sn 


{Sn + bf r 


< 


n - pn + a - 7\ ^ (Pn + h 


hn{u) tpniu) du 


'(1 - WnY r 


2 T.f 'ri-Pn -ha- 5\ fpn-\-l 


< 3 

/foO 


{Sn + 6)^ {Pn + 3)(pn + 1) 


n3(l - WnY (n-pn-h a - 5)(n -p„ + a - 7) 


hn(u) lfn{u) du 


0 a.s.(Po)- 


Note that the last inequality holds because a random variable with density h„ is stochas¬ 
tically smaller than a random variable with density hn and because xfn is nondecreasing 
on (0,1), while the almost sure convergence to zero is by Lemma 3 and the fact that 
limsupjj_^oo Wn < 1 — a < 1 a.s.(Po) by Lemma 13. 

Proof (Proof of Lemma 18). Note immediately that both (i) and (ii) are trivial iff = 0 
or f > 1, so assume 0 < f < 1. Next, by Stirling’s approximation, we may bound the 
normalizing constant by 


log 


T{an+bn) {an+bnr-+^--^/^ 

- ^ log .-1/2 ,..-1/2 


r(an)r(6„) Q,“" i)[ 

for all sufficiently large n. We may rewrite this as 

r(a 


log 


r(a„)r(6„) 


^n) ^ 1 f On P 1^71 

< On log 


bn log 


bn. N 1 . / Un.bn 

+ o log 


for all sufficiently large n. Then 
P{Zn < f) 


f 


< 


r(^n -h bn) 
T{an)T{bn) Jo 
r(^n -h bn) 

r(a„)r(6„) Jo 


Jl-z) 


bn-1 


dz 


^ ..an-l ^ 1"(®" + 


r(^n)r(^n) On 


< exp 


an log f - log an + On log 


+ bn log 


On P bn 


o log > „ 

A \ Q/og 
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for all sufficiently large n. Now observe that 
log P{Zn < 0 


n 

< — log^ - - loga„ + — log 
n n n 


bn 1 

— log , , 

n \ bn 


(1 — a) log ^ — (1 — a) log(l — a) — a log a if a > 0, 
log^ ifa = 0. 


5“ i , h 

in \ On + bn 


If a > 0, then (1 —a) log(l —a)+a log a > —log2, and i/iws limsupjj_^oo log P(Z„ < 
f) < (1 —a) log^+log2. Then n~^ logP(Z„ < ^) < (1 —a) logf+logA for all sufficiently 
large n, which implies (i). If instead a = 0, then limsup^^Q^ n~^ logP(Z'„ < f) < log^, 
so n~^ log P{Zn < f) < f logc^ for all sufficiently large n (noting that log^ < Oj. This 
implies (ii). 

Proof (Proof of Lemma 19). Let 5 = liminf„_>oo ||7n~/3on|l2 > 0- Then by Lemma 13, 
limsup„_^^H4 < (1 — a)AmaxO’g/((5 + Amaxcrg) a.sfPo), which implies that Wn < 
2(1 — Q:)AniaxO'o/((5 + 2AmaxO'o) <1 — 0 for all Sufficiently large n a.s.(Po) • Then for all 
sufficiently large n a.sfPa), 


Rn P I fniu) . , 

J{l-a+Wn,)/2 


- 3/2 


exp 


> exp — 


nWn. 


fn{u) 


nWn{l - u) 

' 2{u - Wn) 

1 - 3/2 


du 


I — a — Wn J J(^l-a+Wn )/2 L(l~^) 


du 


^ 'n-pn + a-3\ ^ f Pn + 1 


^ n-pn + a \ ^ I Pn-‘^ 


X P, 


M 


2 y V 2 

1 — a+ Wj 


<Un<l 


exp 


$n,Sn 


nWn 


1 — O — Wn 


^ 'n-p^ + a-3\ ^ (pn + I 


> 


^ n-pnPa \ Pn-2 


X Pm 


2 / V 2 

d ~t“ 4Amaxf^o 
2J + 4Aniax<^0 


exp - 


nWn 


1 — a — Wn 


(l-a)<Un<l 


(14) 


whereUn ^ Beta{(n — pn + a—'i)/2,(pnP\)/‘2), independent of f3n and Sn, under Pm- 
For all sufficiently large n, Stirling’s approximation yields that 
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n-pn + a-3\ fpn + l 


n - Pn + a\ ( Pn - 2 


> 


I o\ {n—p„+a—4)/2 / „ 

n — Pn + a — ( n — Pn + a — 6 


exp - 


n-pn + a 


{n-pn+a-l)/2 


Pn + 1 


Pn/2 


exp 


exp 
Pn + 1 


n - p„ + a 


Pn 2 


(Pn-3)/2 


exp 


Pn 2 


1 fn — pn + a — 3 

2 


(n-p„+a-4)/2 


n-pn +a 

Then for all sufficiently large n, 

n - Pn + a - 3\ (Pn + I 


/ Pn + 1 / Pn + 1 \ 

\Pn-2j \n-pn + aj 


s /2 


n-pn + a\ ( Pn- 2 


> 


1 / pn + 1 

Vn - Pn + a/ 


4 \n- Pn + a 


Now observe that 


P, 


M 


b ~\~ 4Aniax^0 


25 + 4Amax<^0 


by Lemma 16, which implies that 
Pm 


5 4Aniaxf^0 


25 4Amaxf^o 


{l-a)<Un<l 


il-a)<Un<l 


> 


(15) 


(16) 


for all sufficiently large n. Then by combining Inequalities 14, 15, and 16, we have that 
for all sufficiently large n a.sfPo), 


Rn > (4n)-3/2 


exp 


nWn 


1 — a — W„ 


= exp < —n 


Wn 


1-a-Wn 


-log(4n) 


Finally, take K = 2limsup„^(^[W„/(l — a — Wn)]- Observe that K < oo a.s.{Po) due 
to the fact that limsup„_^oc Wn < (1 — 0 i)\nis.^<yQ/(5 + AmaxCQ) < 1 — a a.s.{Po). Then 
Rn > exp{—nK) for all sufficiently large n a.s.{Po). 

Proof (Proof of Lemma 20). Let e > 0, and assume ||7n — /SonHl oo. Then by 
Lemma If, Wn —>■ 0 a.s.{Po) and Ln{e) —>■ 0 a.s.(Po)- Next, observe that the last two 
terms of the integrand in Qn{P) comprise an unnormalized InverseGamma(l/2, nWn/2) 
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density in {u—Wn)/{i — u), the mode of which occurs atnWn/'i- Then for all sufficiently 
large n, 


rLn{e) 

Qn(e) < / fn{u) 

JWn 


nW„ 


-3/2 


exp du<2 {nWn) J fn{u) du 




by Lemma 18. Now note that if e“^||7n — /3on||oo < 1, then Ln{e) = Wm in which case 
Qn(e) = 0 and the result is trivial. So instead assume that e“^||7n — /3on||oo > 1, which 
in turn implies that Ln{e) < e“^||7n — (SonWooWn. Then 


< 2^”+^ I 1 + 


^-^\hn-f3on\\o.{Sn+b) 

3/2 


e ^Wln — f^OnWooiSn + b) + Tn 


(fPn^O IIT^" ^0n|l2^ 


n-pn 


(n - Pn) Tn 
{Sn + b) Oon 


X 1 + 


+ nXo^ ||7„ - /Jonlla) 

{n-Pn)\hn - /Sonlloo 


i(l —ck) 


(n - pn) Tn 


3/2 


< 


22n+l ^-3/2 I 4 ||7ra /3on|l2 j / e||7ra /3o- 


. (Sn + b) 0On. 

V —n(l—a) 


'’^112 


(1 - a)Xon(7Q 


2(1 Q:)Aon^o 


= 2"(3-a)+4(^^)-3/2 11-„(l-„)+3 


for all sufficiently large n a.s.{Po) by Lemmas 3 and 4 since the quantity in square 
brackets converges to l/ug a.s.(Po)- Now continue by writing that for all sufficiently 
large n a.s.(Po)i 


Qn{e) 


< 


exp|-n ^1 - a - log (|| 7 „ -/3on|l2) - - a - log (e ^(1 - a)AmaxCTo) 


2n 


+ tt log(”«) - 3 - a + - log 2 


= exp [-nKn{e)] 


where Kn(e) oo is defined to be the quantity in square brackets. 

Proof (Proof of Lemma 21). Assume ||7n—/Sonlll —>■ ^ > 0, liminf„_>.oo | |7n —/3on| |oo > 
0, and a = 0. Let e > 0. Note that Rn > 0 for all n a.s.(Po) since Wn < 1 for 
all n a.s.{Po). Then whenever Ln{e) < Wn, we immediately have that Qn{^)/Tln = 0 
exactly, so we may instead assume that Ln{e) > Wn for all n. By Lemma 15, there exists 
L*{e) < 1 such that Mmsupn^oo ^nie) < L*{e) a.s.{Po), which implies that Ln{e) < [1 + 
L*(e)]/2 for all sufficiently large n a.s.{Pif). Then we may write that for all sufficiently 
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large n a.s.{Po), 


Rn > 


' [l+L„(e)]/2 


fn{u) 


.(1- W). 


-3/2 


exp 


nWn{l - u) 

' 2{u - Wn) _ 


> exp < — 


> 


nW^ [1 - L„(e)] 


2 [1 + Ln{e) — 2Wn] j 7[3+L*(e)]/4 

n-pn + a-3\^/'pn + l 


fn(u) 


du 

1 -3/2 


(1-m) 


du 


n-pn + a\^ f Pn-2 


X Pi 


M 


3 + L*(e) 


<Un <1 


exp< - 


$n, Sr 


nWn [1 - Lrrie)] 

4[Ln{e)-Wr,] 


> (4n)-3/2 

^ I 4[L„(e)-iy„] J 

by Inequalities 15 and 16. Next, write Qn{^) o,s 


rLn{e) 

Qn{e) = / fn{u) 

Jw„ 


U-Wr, 


-3/2 


exp 


" L 2(w-W„)J 


Wnil - u) 


exp 


{n - l)Wr,{l - u) 
2{u-Wn) 


(17) 


du. 


The second and third terms of the integrand comprise an unnormalized 

InverseGamma{l/2, WnjT) 

density in [u — Wn)/(}. — u), which has mode Wnf3. Then 


Wr, 


-sll 


Qn(e) < — exp -- exp - 


(n-l)W^„[l-L„(e)]\ 


2 [Lrr(e) - Wrr] j Jq 


fn{u) du, 


by Lemma 18. Then this result and Inequality 17 together yield that for all sufficiently 
large n a.s.{Po), 


Qrrie) ^ 


Rn ^ V 2n 


[L„(e)]"'/%xp<; - 


nWn [1 - Lnie)] (n-l 1 


< 


Now observe that 


2n 

Wn. 


3/2 


exp 


2[Ln{e)-Wr,] \ n 

nWn [1 - L*{e )]' 

16 


( 18 ) 


lim inf Wn 

n—foo 

= liminf ( 1 + 


ri-J-oo \ Sn + b 
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= lim inf 

n—¥oo 


[, , PnO-Q+nAo„i ||7„ 

- /3on 2 

(n - Pn) Tn 

1" ' n-pn 

_ {Sn + b) 0o„_ 


> (l + ^/AininCTQ) — 


A 


min<^0 


A. + Amin^^o 


a.s.(Po), 


which implies that Wn > AininO'Q/(2A +AminO'Q) for all sufficiently large n a.s.(Po)- We 
may combine this with Inequality 18 to yield that for all sufficiently large n a.s.(Po), 


Qn(e) 


2n{2A + AminC^o 




'min^O 


3/2 


exp 


nXrai-gCrl [1 - P^(£)] 

16 ( 2 ^ + AniinCp) 


—7> 0 a.sfPo) 


since L*{e) < 1. 

Proof (Proof of Theorem 4). By Lemmas 11 and 17, posterior consistency occurs if 
-Pm[( 5+l)"^ll7«-/3on||oo > e I /3„, Sn] —>■ 0 a.s.(P q) for every e > 0. Then by (7), this 
occurs if Qnie)!Rn —>■ 0 a.s.(Po) for every e > 0. We now proeeed aecording to cases 
similar to those in the proofs of the previous theorems. 

Case 1: Suppose there do not exist a subsequence kn of n and a constant A > 0 such 
that || 7 fc„ — /Sofcnili ^ o,nd || 7 fc„ — /3ofc„||oo ^ 0. Let kn be a subsequence of n, and 
let e > 0. Now consider two sub-cases. 

Case 1.1: Suppose \\'yk„—f3Qn\\oo —>-0. Then e~^\\'jkr,~l3ok„\\oo < 1 for all sufficiently 
large n a.s.{Po). This implies that Pfc„(e) = Wk„ and Qk^i^) = 0 for all sufficiently 
large n a.s.(Po). Also, Rk„ > 0 for all n a.s.(Po) since < 1 a.s.(Po). Therefore, 
Qkn.{^)/Rkn 0 o.s.(Po). 

Case 1.2: Note that Case 1.1 can be applied to any further subsequence to„ of kn for 
which || 7 m„ — /3om„||oo —>■ 0, so we may suppose for Case 1.2 that liminfji_^oo || 7 fc„ ~ 
/3ofc„||oo > 0. Note also that in this case, there cannot exist any further subsequence 
of kn for which || 7 m„ ~/ 3 om „||2 converges to a nonzero constant, since this would con¬ 
tradict the original supposition of Case 1. Then since liminf„^oo || 7 fe„ — /Sofcnili ^ 
liminf„_>oo ||7fc„~/3ofc„ 11^ > O 7 it follows that \ | 7 fc„—/9ofc„ ||| 00 . Observe that by Lem¬ 

mas 19 and 20, there exist a constant K and a sequence of constants Kn(e) —> 00 such 
that Qk„{i)/Rk„ < exp{—n — K]} —>■ 0 a.s.(Po). Finally, since Cases 1.1 and 1.2 
together establish that Qk^{e) / —>■ 0 a. s.{Pq) for every subsequence kn, it follows that 

Qnief/Rn 0 a.s.(Po), and therefore posterior consistency occurs. 

Case 2: Now suppose there exist a subsequence kn of n and a constant A > 0 such 
that || 7 fc„ — /^ofcnlli —>■ ^ > 0 and ||7fc„ — /3ofc„||oo ^ 0, and suppose 0 = 0. Note that 
Case 1 can be applied to any subsequence mn of n for which either ||7m„ does 

not converge to any nonzero constant or ||7m„ — /3om„||oo 0, so it suffices to show 

that Qknir)/Rkn, —>■ 0 a.s.(Po). Note also that this means we may suppose for Case 2 
that liminf„_).oo ||7fc„ ~ /3ofc„||oo > 0. Now let e > 0. Then we immediately have that 
Qk„{^)/Rn —>■ 0 a.s.(Po) by Lemma 21. Therefore posterior consistency occurs. 












D. K. Sparks, K. Khare, and M. Ghosh 

References 


663 


Armagan, A., Dunson, D., Lee, J., Bajwa, W., and Strawn, N. (2013). “Posterior 
consistency in linear models under shrinkage priors.” Biometrika, 100(4): 1011-1018. 
628, 630 

Barron, A., Schervish, M., and Wasserman, L. (1999). “The consistency of posterior 
distributions in nonparametric problems.” Annals of Statistics, 27: 536-561. 627 

Berger, J., Ghosh, J., and Mukhopadhyay, N. (2003). “Approximations and consistency 
of Bayes factors as model dimension grows.” Journal of Statistical Planning and 
Inference, 112: 241-258. 628 

Bernstein, S. (1934). Theory of Probability. GTTl, Moscow. 631 

Bontemps, D. (2011). “Bernstein-von Mises theorems for Gaussian regression with 
increasing number of regressors.” Annals of Statistics, 39: 2557-2584. 628 

Diaconis, P. and Freedman, D. (1986). “On the consistency of Bayes estimates.” Annals 
of Statistics, 14: 1-26. 631 

Doob, J. (1948). “Application of the theory of martingales.” Collogues Internationaux 
du Centre National de la Recherche Scientifique, 13: 23-27. 631 

Fernandez, G., Ley, E., and Steel, M. (2001). “Benchmark priors for Bayesian model 
averaging.” Journal of Econometrics, 100: 381-427. 629, 634 

George, E. and Foster, D. (2000). “Calibration and empirical Bayes variable selection.” 
Biometrika, 87: 731-747. 627, 641 

Ghosal, S. (1999). “Asymptotic normality of posterior distributions in high-dimensional 
linear models.” Bernoulli, 5: 315-331. 628 

Ghosal, S., Ghosh, J., and van der Vaart, A. (2000). “Convergence rates of posterior 
distributions.” Annals of Statistics, 28: 500-531. 627 

Ghosh, J., B. Sinha, B., and Joshi, S. (1982). “Expansions for posterior probability and 
integrated Bayes risk.” In Gupta, S. and Berger, J. (eds.), Statistical Decision Theory 
and Related Topics, volume 3, 403-456. Academic Press, New York. 627, 631 

Jiang, W. (2007). “Bayesian variable selection for high-dimensional generalized linear 
models: Convergence rates of the fitted densities.” Annals of Statistics, 35: 1487-1511. 
628 

Kass, R. and Wasserman, L. (1995). “A reference Bayesian test for nested hypotheses 
and its relationship to the Schwarz criterion.” Journal of the American Statistical 
Association, 90: 928-934. 634 

Laplace, P. (1774). “Memoire sur la probabilite des causes par les evenements.” Memoire 
Academy Recherche Scientifique, 6: 621-656. 631 

LeCam, L. (1982). “On the risk of Bayes estimates.” In Gupta, S. and Berger, J. (eds.). 
Statistical Decision Theory and Related Topics, volume 3, 121-138. Academic Press, 
New York. 631 


664 


Posterior Consistency under g-Priors 


Lee, J. and Oh, H.-S. (2013). “Bayesian regression based on principal components for 
high-dimensional data.” Journal of Multivariate Analysis, 117: 175-192. 628 

Liang, F., Paulo, R., Molina, G., Clyde, M., and Berger, J. (2008). “Mixtures of g priors 
for Bayesian variable selection.” Journal of the American Statistical Association, 103: 
410-423. 627, 629, 637, 641 

Mallick, B., Ghosh, D., and Ghosh, M. (2005). “Bayesian classification of tumors using 
gene expression data.” Journal of the Royal Statistical Society B, 67: 219-234. 627 

Moreno, E., Giron, F., and Casella, G. (2010). “Consistency of objective Bayes factors 
as the model dimension grows.” Annals of Statistics, 38: 1937-1952. 628 

Shang, Z. and Clayton, K. (2011). “Consistency of Bayesian linear model selection with 
a growing number of parameters.” Journal of Statistical Planning and Inference, 141: 
3463-3474. 629 

von Mises, R. (1964). Mathematical Theory of Probability and Statistics. Academic 
Press, New York. 631 

Zellner, A. (1986). “On assessing prior distributions and Bayesian regression analysis 
with g-prior distributions.” In Goel, P. and Zellner, A. (eds.), Bayesian Inference and 
Decision Techniques, 233-243. North-Holland, Amsterdam. 627 

Zellner, A. and Slow, A. (1980). “Posterior odds ratios for selected regression hypothe¬ 
ses.” In J. Bernardo, D. L., M. DeGroot and Smith, A. (eds.), Bayesian Statistics, 
585-604. University Press, Valencia. 627, 629, 639, 641 

Zhang, Z., Jordan, M., and Yeung, D. (2009). “Posterior consistency of the Silverman g- 
prior in Bayesian model choice.” Advances in Neural Information Processing Systems, 
22. 629 

Acknowledgments 

This work was supported in part by NSF Grants DMS-1106084 and DMS-1007494. 


